Bidiagonalization and symmetric tridiagonalization by systolic arrays

نویسنده

  • R. Schreiber
چکیده

We give a systolic algorithm and array for bidiagonalization of an n x n matrix in O(nlog, n) time, using O(n2) cells. Bandedness of the input matrix may be effectively exploited. If the matrix is banded, with p nonzero subdiagonals and q nonzero superdiagonais, then 4n In(p + q) + O(n) clocks and 2n(p + q ) + O((p + q)’ + n) cells are needed. This is faster than the best previously reported result by the factor log, e = 1.44.. .. Moreover, in contrast to earlier systolic designs, which require the matrix to be preloaded into the array and the result matrix extracted after bidiagonalization, the present arrays are pipelined. Work reported herein was supported in part by Cooperative Agreement NCC 2-387 between the National Aeronautics and Space Administration (NASA) and the Universities Space Research Association (USRA) and pmally supported by Office of Naval Research (ONR) under Contract NOW 1486-K-06 10. Bidiagonalization and Symmetric Tridiagonahation by Systolic Arrays* Robert Schreibert November 29, 1988 Abstract We give a systolic algorithm and array for bidiagonalization of an n x n matrix in O ( n log, n) time, using O(nZ) cells. Bandedness of the input matrix may be effectively exploited. If the matrix is banded, with p nonzero subdiagonals and q nonzero superdiagonals, then 4n In(p + q) + O(n) clocks aad 2n(p -+ q ) + O( ( p + q)2 + n) cells are needed. This is faster than the best previously reported result by the factor log, e = 2-44 0. Moreover, in contrast to earlier systolic designs, which require the matrix to be preloaded into the array and the result matrix extracted after bidiagonaiization, the present arrays are pipelindWe give a systolic algorithm and array for bidiagonalization of an n x n matrix in O ( n log, n) time, using O(nZ) cells. Bandedness of the input matrix may be effectively exploited. If the matrix is banded, with p nonzero subdiagonals and q nonzero superdiagonals, then 4n In(p + q) + O(n) clocks aad 2n(p -+ q ) + O( ( p + q)2 + n) cells are needed. This is faster than the best previously reported result by the factor log, e = 2-44 0. Moreover, in contrast to earlier systolic designs, which require the matrix to be preloaded into the array and the result matrix extracted after bidiagonaiization, the present arrays are pipelind

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Tikhonov regularization of large symmetric problems

Many popular solution methods for large discrete ill-posed problems are based on Tikhonov regularization and compute a partial Lanczos bidiagonalization of the matrix. The computational effort required by these methods is not reduced significantly when the matrix of the discrete ill-posed problem, rather than being a general nonsymmetric matrix, is symmetric and possibly indefinite. This paper ...

متن کامل

Solution of eigenvalue problems on heterogeneous computing architectures

In this paper are presented current achievements and the state-of-the-art algorithms and implementations for dense linear algebra on traditional architectures such as single-core machines or distributed memory parallel machines. Also, this paper summarizes the current implementations and publicly available libraries for basic linear algebra for multi-core and many-core architectures (e.g. graph...

متن کامل

اثرالکترونهشت تناوبی نامتقارن در محلول‌های با pH متفاوت برساختار بلوری و خواص مغناطیسی نانو سیم‌های Co‏

Co nanowires were fabricated onto the nanoporous aluminum oxide arrays using ac electrodeposition technique. The influence of frequency and pH on the magnetic properties and crystal structure of symmetric (18-18 V) and non-symmetric (18-12 V) electrodeposited Co nanowires was investigated. A direct relation was seen between saturation magnetization, squareness and coercivity of the samples prep...

متن کامل

High Performance Computing in Material Sciences Higher Level Blas in Symmetric Eigensolvers High Performance Computing in Material Sciences Higher Level Blas in Symmetric Eigensolvers

In this report a way to apply high level Blas to the tridiagonalization process of a symmetric matrix A is investigated. Tridiagonalization is a very important and work-intensive preprocessing step in eigenvalue computations. It also arises as a very central part of the material sciences code Wien 97 (Blaha et al. [12]). After illustrating the drawbacks and limitations of the tridiagonalization...

متن کامل

Tridiagonalization of a symmetric matrix on a square array of mesh-connected processors

A parallel algorithm for transforming an n × n symmetric matrix to tridiagonal form is described. The algorithm implements Givens rotations on a square array of n × n processors in such a way that the transformation can be performed in time O(n log n). The processors require only nearest-neighbor communication. The reduction to tridiagonal form could be the first step in the parallel solution o...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • VLSI Signal Processing

دوره 1  شماره 

صفحات  -

تاریخ انتشار 1990